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For: Duplex Probes for Hybridization 

Reactions 

DECLAR,A.TION UNDER 37CFR1.132 
I, Mark A. Reynolds, declare and state as folio-vvs: 

1 . I am the Senior Director of Assay Research at Incyte Genomics, Inc., the assignee 
of the subject patent application. I hold a Ph.D. in pharmaceutical chemistry and have over 17 
years industrial experience in nucleic acid research, and am knowledgeable and experienced in 
the field of raicroarrays. I have read and am familiar with the contents of the above application 
and the office action mailed 9/13/00, 



polynucleotide target stably associated wth a surface of a solid support, wherein one of the probe 
and target is double-stranded comprising complementar)- strands, and the other is single-stranded 
having complementarity with one of tlie complementar\' strands, under conditions wherein the 
probe and tai-get hybridize and the probe is thereby immobilized. Every aspect of this claimed 
method is amply described, including methods for making the probes (p.8, line 12 - p.9, line 1 0) 
and targets (p.6, line 19 - p.7, line 22); methods for stably associating the targets to a solid 
support surface (p,7, line 23 - p.8, line 1 1) and conditions for hybridizing the probe to the 
immobilized target (p.lO, lines 5-8). In addition, the entire method is repeatedly exemplified in 
experimental detail (p.l2, line 5 -p.l8, line 10). 

Claim 3 is the method of claim 1 further comprising the step of releasing the immobilized 
probe. This is trivial - the additional step merely requires subjecting the probe to conditions 
whereby it would be released from the immobilized target. Conditions for hybridizing and 
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Claim 1 requires only a single step: combining a polynucleotide probe with a 



11/22/2080 15:40 



6503434342 



SCIENCE^tTECHNOLOGY 



PAGE 09 




t 



dehybridizing polynucleotides are routine^ well-known in the art and involve no more than 
adjusting temperature and/or salt concentrations (e.g. Maniatis et al., cited on p.lO, line 6; see 
also p. 17, line 1 1). The specification readily enables those skilled in the art to hybridize the 
probe to the immobilized target; with or without our specification, they are readily able to 
dehybridize ii. 

Claim 4 is the method of claim 1 furtlier comprising the step of amplifying the 
immobilized probe. Methods for amplifying a given polynucleotide are routine and well- 
established (e.g, Maniatis et al., cited on p.8, line 22); amplification is in fact how the probes are 
generally made (p.8, line 28) - the targets too (as amplified PGR elements, p. 17, lines 4-5). 

Claim 5 is the method of claim 1 further comprising the step of amplifying the 
immobilized probe and then detecting resultant amplified probe. The additional step (over claim 
4) of detecting the amplified probe is routine: methods for labeling amplified probes are 
described (p. 8, lines 27-28) as are methods for detecting them (p. 10, lines 8-11). 

Claim 6 is the method of claim 1 further comprising the steps of releasing and amplif>'ing 
the probe to produce a labeled, amplified double-stranded probe, hybridizing the labeled probe 
to a target polynucleotide to immobilize the labeled probe and detecting the labeled probe. The 
additional step (over claims 3 and 5) of hybridizing the probe to an immobilized target 
polynucleotide requires no more than repeating the step of claim L Hybridizing probes to 
immobilized targets is thoroughly described and exemplified (see e.g. Examples 1 and 2). 

Claim 7 is the method of claim 1 further comprising the steps of releasing and amplifying 
the probe to produce a labeled, amplified double-stranded probe, hybridizing the labeled probe 
to a target polynucleotide to immobilize the labeled probe and detecting the labeled probe, 
wherein the probe is double-stranded and target is single-stranded. The additional limitation 
(over claim 7) is the requirement that the probe is double-stranded and target is single-stranded, 
as exemplified in Examples 3-10* 

How^ to isolate or release an immobilized probe, how to amplify such probe and how to 
detect an amplified probe are all trivial tasks to those skilled in the art of solid phase 
hybridization assays* In my expert opinion, the teachings and exemplification of the 
specification fully enable one of ordinary skill in the art to practice the methods recited in the 
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claims without undue experimentation. In particular, the specification enables one of ordinary 
skill in the ait. without requiring any undue experimentation, to release and amplifs' the recited 
immobilized probe, to label the resultant amplified probe, and to then hybridize the labeled probe 
to a target polynucleotide to immobilize the labeled probe and to detect the labeled probe. 

I hereby declare that all statements made herein of my o^^'n knowledge are true and that 
all statements made on information and belief are believed to be true: and further that these 
starements are made with the knowledge that willftil false statements ^d the like so made are 
punishable by fme or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such mllM, false statements may jeopardize the validity of the application and 
any patent issuing tlierefrom. 



Date: Nov 22, 2000 



iMark A. ReynoIds6Ph.D. 
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Parallel human genome analysis: Microarray-based expression 
monitoring of 1000 genes 

(Human Genome Projret/DNA cliip/gene discovery/T Cell) 

Makk SCHENA-t. Dar. Sha»n«, Renu Heobr". andrct Cha,'. Patkick O. BbownS, am. Ronau, W. Davis- 

Contributed by Ronald IV. Davis, June 26. 1996 

ABSTRACT Microarrays ct^ntaining 1046 human cDNAs 
of unknown sequence were printed on glass with high-speed 
roboUcs. nim LO-cm^ DNA -chips" were used to quaniita- 
Uvely monitor differential c3<pression of the cognate human 
genes using a highly sensitive two-color hybridl^tion assay. 
Array elemenu that displayed differential exprcsaion patterns 
under given experimental conditions were characterized by 
sequencing. The Identification of known and novel hear shock 
and phorbol ester^regulated genes in human T cells demon- 
strates the sensitivity of the assay. Parallel gene analysis with 
microarrays provides a rapid and efficient method for large- 
scale human gene discovery. 



Biology has entered the genome lira (1). Complete genome 
sequences for all of the model organisms and human will 
probably be available by the year 2003 (2). Torrents of human 
expressed sequence tags (ESTs) provide a starting point for 
elucidahng the function of tens of thousands of cognate genes 
(3). Genome analysis vvill piwide insights into growth, devel- 
opment, differentiation, homeostasis, aging, and the onset of 
diseases (1-3). A deified understanding of the human gcnoxne 
wili require the iujplenicntatiou of sophisticated methods for 
gene expression analysis and gene discovery. 

Recently, a OJicroarray-based method for high-diroughput 
monitoring of plajit gene expression was described (4). This 
"chip'-bajied approach involved using microarrays of cDNA 
clones as gene-specific hybridization targets to quantitativelv 
measure expression of the corresponding plant genes (4, 5). A 
two-color fluorescence labeling und detection scheme facili- 
tated sensitive differential exprwioii analysis of different 
plant tissues (4, 5), The efficiency of this ^pp^oach for studies 
in higher plants suggested the use of this method for human 
genome analysis (4-7). Here, we report the use of cDNA 
microarrays for human gene expression monitoring, biological 
investigation, and gene discovery. 

MATERIALS AND METHODS 

Human cDNA Clones. The cDNA library was made with 
mRNA from human peripheral blood Ivmphocytes trans- 
formed with the Epstein-Oarr virus. Inserts >ftOO bp vvere 
cloned into the lambda vector WHS-R to generate lO'^-lO^ 
recombinants. Bacterial trans form an us were obtained bv in- 
fecung £, coil strain JM1I)7/aKC. Colonies were picked at 
random and propagated in u %«well format, and miniiysatc 
DNA was prepared by alV^linc lysis using RE\L preps 
(Oiagen. ChaLsworth, CA). Instiris were amplified by PCR it^ 
a y5-well formal using primers (PANI32. 5'-CCTC- 
TATACTTTTaaCGTCaaGG. und PAN 1 33. f^TTGTGTG- 
GAATTGTCACCGG) ^.oniplcmontary lo the AYFiS 
polyhnUer und containing i\ MX-carbon iirnino modification 

The pubhcuuuii cu5ii 01 tl^ti jfticjiy v.crc d^ifMivcc m nurt hv pain: cliLfLit: 
pjymcni This ariicJc muii( uicrelurt l-c hcrchv'mi.rKt; j ■ ii.hYr/tii'tn^nr -u 
.ICCOrO^ncL wi(h I^i U.S.C ? I 734 viloiv [o miiici^iu eMk fact. 



(GJen Research, Sterling, VA) on the 5' end. PGR products 
were purified in a 96-well format using QiAquick columns 
(Qiagen). 

Microarray Preparation, Amino-modified PGR products 
were suspended at a concentration of nig/ml in 3x 
standard saline citrate (SSC) and arrayed from 96-wcll micro- 
titer plates onto siiylated microscope slides (GEL Associates, 
Houston) using high-speed robotics (4-7). A total of 1056 
cDNAfi* representing £046 human clones and iO ArabidopsLs 
controls, were arrayed in l.O-cm^ areas. Printed arrays were 
incubated for 4 hr in a humid chamber to aJ)ow rehydration of 
the array elements and rinsed, once in 0.2% SDS for I min, 
twice in H2O for I min, and once for 5 min hi sodium 
borohydride solution (LO g of NaBH4 dissolved in 300 ml of 
PBS and 100 ml of 100% ethanol). The arrays were submerged 
m H2O for 2 min at 95''C, transferred quickly into 0.2% SDS 
for 1 min, rinsed twice in H^O. air dried, and stored in the dark 
at ZS'G. 

Fluorescent Probes. Tissue mRNAs were purchased 
(CLONTECH). Jurkat mRNA was isolated as described by 
Schena et aL (4). Probes were made as described (4) with 
several modifications. The reverse transcriptase used here was 
Superscript 11 RNase H- (GIBCO). The C^-i-dOTP was 
purchased from Amcrsham. Each reverse uranscription reac- 
tion contained 3.0 ^g of total human mRNA. Arabidopsis 
control mRNAs were made by viiro transcription of cloned 
H AT4, HAT22, and YesAt-23 cDNAs (4, 8. 9) using an RNa 
Transcription Kit (Stratagene), For quantitation, the mRNAs 
were doped into the reverse transcription reaction at ratios of 
1:100,000. 1:10.000. and I:IOOO(wt/wt) respectively. Following 
the reverse transcription step, samples were treated with 2.5 ^[ 
Of 1 M sodium hydroxide for 10 min at 37X. then neutralized 
by adding 2,5 of 1 M Tris-HCI (pH 6-8) and 2.0 /il of I M 
HQ. Probe mixtures contained cDNA products derived from 
3 Mg of total mRNA, su.«ipended in 5,0 mI of hybridization 
buffet- (5x SSC plus 0.2% SDS). 

Hybridization and Scanning* Probes were hybridized to 
l-O-cm- microarrays under a 14 X 14 mm glass coverslip for 
6-12 hr at fiO'C in a custom-built hybridization chounber (4-7). 
-Arrays were washed for 5 min at room temperature (25^C) in 
low stringency wash buffer (1 x SSG/O.2'% SDS), then for 10 
min at room temperature in high stringency wash buffer (O.ix 
SSC/O-Z-^c SDS). Arrays were scanned in O.lx SSC using a 
fluorescence laser scanning device (4-7), fitted with a custom 
filter ?:ct f Chroma Technology, Brutileboro. VT). Accurate 
differential expression measurements (i.e., final fluorescence 
ratios) were obtained by taking the average of the ratios of two 
independent hybridizations. 

Ahbrcviation: CST. ^JTjpresscO sequence ug. 

Data dcno.<iition: The sequences icponcd in this paper h;ivc been 

•To whom reprint rcquc5t.s sbouia f^a addresseiJ. email- ^chviv^ro 
cm cm .sianfoiM.edu. 
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Biochemistry: Schena et al. 

CelJ Culture. JurJ^at cells were ^rown in a tissue culture 
'''^'i!'fnl' (^7°C and 5% CO2) in RPMI medium supplemented 

.ci/'' ^^^^ ^^^""'^ streptomycin per m! 

and 500 uaUs of penicillin per ml Heat shock corresponded to 
a 4-hr incubaiion at 43°C. Phorbol ester treated ceils were 
grown (or 4 hr in the presence of 50 ng of phorbol 12^£nyri5taie 
13-acetaie (PMA) per ml. ^ 

RNA BlottJng. Dot blots were performed as described (4) 

DNA Sequencing. Sequences were obtained usinc the 
PAN132 and PAN133 prinjers and a 373A automated se- 
quencer, according to the instructions of the manufacturer 
(Applied Biosystems), 

Compuier Graphics and Informatics. Pseudocolor represen- 
tations of fluorescent images were made with National Institutes 
ofHeaith iMAGEsoftware (version 1.52). Software for differemiaj 
expression representations was purchased from Imaiting Re- 
search (St Cathenne^s. ON, Canada). Sequence searches were 
oiade to the nonredundam nucleotide data base at the National 
Center for Biotechnology Information (NCBI) using Macintosh 

^^'^ ^^^^^ ti^e Worid 

Wide Web (http:/www.ncbi-nlm.nih.gov/). 

RESULTS 

Gene Discovery and the Heat Shock Response. Microarrays 
were used to examine the heat shock response in cultured 
^ (^''^^''^) ^^ells. Control (37»C) and heat-treated 
(43 C) cells were harvested and lysed, and total mRNA from 
the two ceil samples was labeled by reverse transcriptase 
incorporation of fluorescein- and Cy5-dCrP. respectively. In 
a second set of labeling reactions, the fluorescent groups were 
"swapped^* such that samples from control and heat-treated 

-Heat Shock 



Five. NaU. Acad, ScL USA 93 (1996) 10615 

samples were labeled with Cy5- and fluorescein-dCTP. respec- 
tively. Each pair of f luorescem probes was hybridized to a 
I056^iement raicroarray. The arrays were washed at high 
stringency and scanned with a confocaJ laser scanning device 
to detect emission of the two fluorescent groups. 

Hybridization signaJs were observed to >9S% of the human 
cDNA array elements, but not 10 any of the Arabidopsis 
negative controls (Fig, 1). Fluorescence intensities spanned 
more than three orders of magnitude for the 1046 array 
elements surveyed (Fig. 1). Comparative expression analysis of 
heat shocked versus control cells in the two experiments 
reveaied 17 array elccaents that displayed altered fluorescence 
ratios of a2.0.fold (Figs. 1 and TA). Of the 17 putative 
differentially expressed genes, 11 were induced by heat shock 
treatment and 6 displayed modest repression (Figs. 1 and lA) 
To determine the identity of the heat- regulated genes 
cDNAs corresponding to each of the 17 array elements were 
sequenced on the proximal and distal end. Data base searches 
revealed perfect matches for 14 of the 17 clones, and in each 
case proximal and distal cDNA sequences mapped to the same 
gene (Table i). Of the 1046 human genes examined on the 
microarray the five most highly induced in heat-treated cells 
were heel shock protein 90a (hsp90a), dnaJ. hsp9O0, polyu- 
biquitm» and t-complex polypeptide- 1 (tcp-1) (Table 1). Three 
of the 17 clones did not match any entry in the public data base 
though one of the clones (B7) exhibited significant homology 
to an EST from CaenotkabdUis elegant (Table I). Each of the 
novel sequences (B7.B9) exhibited =»^2-fold induction (Table 1 ) 
and relatively low-level expression (Table 2). 

To confirm the microarray results. mRNA levels for each of 
the genes were measured by RNA blotting. Each of the genes 
that displayed heat shock induction, including the three novel 
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MiccDAffay elements corresponding to diffcrcntiaUy expressed genes 
Row 



Coluum 



Ratio 



B2 

B3 

B4 

BS 

B6 

BJ" 

BS 

39 

BIO 

Bli 

B12 

B13 

Bl4 

B15 

BI6 

B17 

BIS 

B|9 

B20 

B21 

B22 

B2J 



24 



15 



17 
22 



14 

7 

28 
14 
20 
30 
10 
13 



21 



22 
20 



21 
31 

31 
19 



Blast identity 



Accession no. 



16 

10 
30 
26 
IK 
30 
16 



0,5 

0.5 

0.5 

0.5 

0.5 

0,5 

2.0 

2.0 

2,2 

2.4 

2.4 

2.5 

2,5 

2.6 

4.0 

5.8 

6.3 

10 

24 

2.2 

2.6 

3.5 

19 



CYC oxidase HT 
0-Aciin 

CYC oxidiisc JTI 
CYC oxidase III 
CYC 0xida.« HI 

Novell 

Novel? 

Novei+ 

Folyubiquitin 

TCM 

PolyubiquUin 
Pulvubiquitin 

DnaJ homolog 

HSP90a 

HSP90a 

iJi-niicroglobulin 
Novell 

JEJl^microglobuIin 
PQK 

PAC-I 



J0Ut5, J01415 
NR, X00351 
J01415, J01415 
J0X4i5. J01415 
J01415* J01415 
NR, X0035I 
U56653, U56654 
U56655, U56656 
U56657, U56658 
X04803, X04803 
X52382, X52882 
Ml 7597, Ml 7597 
X04803. X04803 
Ml 6660. M 16660 
D13388, D13388 
X07270, X07270 
M27024. XJ51B3 
554761. M30683 
U56659> U56660 
554761. M30683 
Mil 968, UX)16U 
Z47744. M55643 
LI 1329, LI 1329 



_, A-iij-Tf, jk^nj_;v 

;B7 t3 o7f"c identical to an 5ST from C ck^atis tD-c/::hi 
\No match m the pul>lic data iiascs. 



11/22/2000 15:40 6503434342 



SCIENCE&TECHNOLOGY 



PAGE 14 



Biochemistry: Schena et al d i^r . . . « 

Proc, Nail Acad, ScL US.i 93 (1996) mn 

Tabic 2. Human gene expression monitored by microarray and RNA bloi analyses 
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Blast identity 



&yprefij:ion level, per 1(K* niRNAs 
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B3 CYC oxidaiie t!l 

B4 CYC oxidase HI 

B5 C\'C oxidase m 

B6 i^-Actm 

87 Novel (weakly to D76026) 

B8 Nov'el 

B9 Novel 

BlO Poiyubiquitin 

Btl TCP-i 

Bl 2 Poiyubtquitin 

B13 Polyubiquitin 

B14 HS1>90B 

B15 DnaJ horaolog 

B16 HSP90a 

018 Prraicroglobulin 

819 Novel 

820 ^^tcroglobuJin 

821 PliOSphoglyceratt kinase 
B22 NF-KBl 

823 PAC-1 
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sequences, exhibited elevated mRNA levels by dot blot analysis 
(Table 2), la all cases* expression ratios as determined bv chc 
two procedures differed by <2-fold for the genes idemified in 
the heat shock experiments (Table 2). The two assays differed 
more widely in terms of assessing absolute expression levels; 
noneiheless, absolute expression as monitored on a microarray 
typically correlated with RNA blots to within a factor of five 
(Tabic 2). 

Phortwi Ester Signaling, To explore a signaling pathway 
distinct from the heat $hock response, microarrays were used 
to examine the cellular effects of phorbol ester treatment, 
Jurkat cells were treated with phorbol ester, harvested, Ivsed 
and used as a source of mRNA. Samples of mRNA from' 
untreated or phorbol ester-stimulated cells were labeled with 
reverse transcriptase. The probes were mixed, hybridized to 
microarrays, and scanned for fluorescence emission of the [wo 
fluorescent groups. A total of six array elements dfsplaved 
s2.0-foid elevated signals with probes from phorbol ester- 
treated cells relative to control samples (Fig. 2B), 

To determine the identity of the phorbol ester-induced 
genes, clones corresponding to the sijc array elements were 
sequenced. Data base searches revealed perfect matches for 
five of the six sequences (Table I). The two most hichlv 
induced genes were the FAC-I tyrosine phosphatase and 
nuclear factor-kappa 31 (NF-kBI); modest activation was 
observed for phosphoglyceraie kinase and ft-microgiobuiin 
( Ubic 1). One remaining clone (Bl9) did not match anv entrv 
in the pubhc data base (Table I). B19 displayed a 2 Mold 
induction and, similar to the novel heat shock nenes. a reia- 
twcly low absolute expression level (Tables I and 2). All ^ixof 
the phorbol csterMnducible genes displayed increased .stcadv- 
statc mRNA Icvel.-s by RNA blotting (Table 2). PAC-l expres- 
sion (Fig. I; Table 2) defined a detection \imii of 1:500 -"^0 
for the assay. 

Transcript Imaging fn Human Tissues. To determine 
Whether microarrays could be used to monitor c:^prossio-. in 
human tissues, probes were prepared from human bone r:,ar- 



row. brain, prostate, and heart by labeling each mRNA sample 
with CyS-dCTP. In a separate reaction, a control probe was 
prepared by labeling Jurkat mRNA with fluoresce in-dCTP. 
Ilie four Cy5-label€d probes were each mixed with an aliquot 
of the fluorescein-labeled control sample, and the four mix- 
tures were hybridized to separate microarrays. The arrays were 
washed and scanned for fluorescence emission, and hybrid- 
ization signals for each of the tissues samples were normalized 
to the Jurkat control to generate an expression profile for each 
of the 1046 clones present on the array. 

Detectable expression was observed' for all 15 of the heat 
shock and phorbol ester- regulated genes in the four tissue 
types examined (Fig. 3). In general, the expression level of each 
gene in J urkat cells correlated rather closely with expression in 
the four tissues (Table 2; Fig. 3). Genes encoding /S^actin and 
cytochrome c oxidase, the two most highlv expressed of the 15 
genes in Jurkat cells (Table 2). were highlv expressed in bone 
marrow, brain, pro$tate. and heart (Fig. '3/4). Expression of 
cytochrome c oxidase. hsp90a, and the novei B7 sequence was 
signidcanily greater in heart than in the other tissues (Fig. 3), 

DISCUSSION 
Many of the heat shock genes identified in this studv encode 
factors that function either as molecular '^chaperones" 
(HSP90a, HSP9O0, DnaJ. TCP-1) or as mediators of protein 
degradation (polyubiquitin). The ideaiificalion of these se- 
quences is consistent with the biochemical basis of heat shock 
mduciion (10-15). Proteins undergo denatu ration at elevated 
temperatures, and those that fail to maintain proper confor- 
mation must be selectively degraded (10-15). It will be inier- 
esiing to determine whether the three novel heat shock- 
inducible sequences (B7-B9) mediate protein folding and 
turnover cir pos.^ess some other biochemical activftv. Complete 
nucleotide sequence determination, conceptual 'iranslaiton 
expression momioring, and biochemical analvsts should pro- 
vide a detailed functional undersiandinc of these cenes 
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A ^^^"?J"P* Of heat .shock and phort>o( «ier- 

regulated gen« Gene expression levels per mm) mWA^U^Z) 
are shown for 15 genes (Tabic U in hum^ bone marrow (r^d)^ ^Zn 

iccordlneTr' ^^-^'^^r ^^^^ "-P^^ 

according i<) expression levels {A-C). ' 



Proc. Natl. Acad. Sci. USA 93 (1996) 

Phorbo) ester, a potent activator of protein kinase C(16 17) 
induced a set of genes distinct from those involved in the heat 

«.^!f„ ?.*J^T- """^ '"■8"y gene identified in tbis 

study, FAC-1, encodes a nuclear tyrosine Icinasc that may niav 
Mc "e"'*""8 transcription and cell cycle progression 
(la). r«i*--icBl, a second phoibol ester-inducible gene is an 
intensively studied meniber of the Rel transcription facior 
family (19-21). The Re! proteins are activated by a larar 
number of stimuli, including phoibol esten. cytokines bacte- 
nal and viral pathogens, and ultraviolet light (19-21) Modest 
activation was observed for three sequences not known to be 
induable by phorbol esters, including phosphoglycerate ki- 
nase, Prmicroglobulin, and a novel human gene (B19) Ex- 
tensive expression monitoring with microarrays should assist in 
understandtng how each of these genes integrate into jhe 
highly complex phorbol ester signaling pathway. 

It IS striking that four novel human genes were discovered 
with an array of 1000 randomly chosen clones, particularly 
because the beat shock and phorbol ester signaling pathways 
have been so intensively studied (10-21). The facile discovery 
of these sequences underscores the fact that microarrays can 
t>e used for ^ne discovery in the absence of any sequence 
information. By this approach, clones ate chosen at random 
from any library of interest and only those clones that display 
interesting expression patterns are sequenced and character- 
ized This parallel assay, coupled with a modest DNA sequenc- 
mgfachty, allows high-throughput human genome expression 
analysis and gene discovery. 

oJ^t^!*/*"".-^" activated or repressed by a given stimulus 
PiT^^f n1 '° P="hway involved 

(22-24). Detailed examination of these gene expression "sig- 
natures can provide a dynamic view of the mode of action of 
a given signaling substance (22-24). Microarrays may thus 
a^low rapid mechanistic examination of hormones, druas 
ehcitors, and other small molecules; moreover, functional 
analysis of transcription factors, kinases, growth factors cyto- 
kines, receptors, and other gene products should be possible 
Efforts arc underway to develop mRNA amplification strate- 
gies to enable probe preparation from minute tissue samples 
This capability might allow for high-throughput patient screen- 
ing in a clinical setting. © r k owtcco 

The cunent detection limit of the assay allows monitoring of 
mRNA ^Thll il/IS"^'"' -1:500.000 (wt/w,) of the total 
mKrNA. I his iQ-fold increase in sensitivity compared with (he 
original report (4) was achieved largely by modifying the 
coupling chemistry, which reduced background fluoresance 
The significance of this Improvement is considerable in that 
approximately half the human genes identified in this study 
mcluding all four novel sequences, exhibited expression levels 
below the original detection limit of 1:50,000 (4) 

.J- ^ k''7u *° "^^'f* ^-'^''^ *=f'^"S« in expression was 
achieved by the use of two-color fluorescence in the labelini; 
and detection schemes, digitized data collection, and cusioin 
software The importance of this capability is underscored by 
\he fact that nearly all of the genes examined here exhibited 
<6.fold changes m expression. The four novel genes, which 
showed s2.2-fold activation, were probably overlooked in 
previous screens that used conventional differentia) expression 
techniques. It tnay be possible to further improve the precision 
of the m.croarray assay by the use of dosely related fluorescent 
reacwSr ^ """^ '*'''""8 hybridization 

Microarrays offer a number of advantages over other po- 
.t?rt ^'l^-^^P'^'^'y approaches to expression analysis. The 
chip-based approach enables small hybridisation volumes, high 
array densities, and the use of fluorescence labeling and 

."t" ''^^ unattainable with filter-based 
approaches f25. 26). fhe use of cDNA clones provides hy- 
bridizaiion specificity that is not readily attained with olitto- 
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nucleotide arrays (27^30). The parallel format of the assay 
provides a simultaneous differential expression readout for 
>1000 genes. This contrasts with sequencing^based methods^ 
which require serial data collection for oppression analysis (3li 
32). A commercial source of cDNA coicroarrays would greatly 
speed the use of a chip-based approach lo expression analysis 
The availability of Urge numbers of ESTs (3) provides a rich 

^^nJI^L'^^ ""^^^ ^o'' roicroarraying. The 

>400,000 ESTs in the pubhc data bases represent a significant 
?f.^^^ ^^^^^ S*^"=« ^3). Microarrays of thousands of 
£5Ts will provide a powerful analytical tool for future human 
gene expression studies. The 100.000 genes in the human 
genome (2, 33) emphasite the need for microarrays of greater 
density. Attempts to iojprove raicrodeposition techniques are 
underway and should allow construction of arrays containing 
a complete jef of human gene targets (htlp://cmgm.stanfor£l 
edu/-8chena/). Microarrays of 100.000 cDNA elements 
would allow expression moniioring of the entire human ge- 
nome in a single hybridization. This capacity, coupled with 
detailed biochemical analysis of the individual gene products 
would greatly speed the functional analysis of the human 
genome. 

We thank; S, EUedgc (seiiedgecabcm.nnc.edu) for the human cDNA 
Ubrary, Qiagen representatives for help withplasmid puhficatioo. and 
A. J. Smith and colleagues at the Protein and Nucleic Acid fPAN) 
facility (Stanford) for oligonucleotide synthesis and DNA sequendng 
We abo thank members of the Davis, Brown, and Smith laboraioriw 
for cnttcaJ commenU: and helpful discussions and Sytlteni employees 
for technical assistance. Support for R.W.D. was provided by the 
't^w^^fl^i*,"^^*'""^^""" (MCB910601I) and National Institutes 
H ".'^^^l^/i^SS?^^^ ^ National lostiiuies of 

Health (3R21HG00450) and Howard Hughes Medical Institute 
r.O.B. IS an assistant investigator of the Howard Hughes Medical 
Institute. ^ 
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Expression monitoring by hybridization to 
high-density oligonucleotide arrays 
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The human genome encodes approximately 100,000 different genes, and at least partial sequence 
Information for nearly al[ wilt be available soon. Sequence Infonnation alone, however, is insufficient for a 
full understanding of gene function, expression, regulation, and spMce-sUe variation. Because cellular 
processes are governed by the repertoire of expressed genes, and the levels and timing of expression, 
Is Important to have experimental tools for the direct monitoring of large numbers of mRNAs in parallel 
We have developed an approach that is based on hybridization to small, high-density arrays containing 
tens of thousands of synthetic oligonucleotides. The arrays are designed based on sequence information 
alone and are synthesized in situ using a combination of photolithography and oligonucleotide chemistry. 
RNAs present at a frequency of 1 :300,0Q0 are unambiguously detected, and detection Is quantitative over 
more than three orders of magnitude, Thia approach provides a way to use directly the growing body of 
sequence information for highly parallel experimental Investigations. Because of the combinatoria) 
nature of the chemistry and the ability to synthesize small arrays containing hundreds of thousands of 
specifically chosen oligonucleotides, the method is readily scalable to the simultaneous monitoring 
of tens of thousands of genes. 

Keywords: gene expression* mRNA abundance^ parallel, quantitative, hybridization, combinatorial synthesis, 

oligonucleotide arrays, DNA chips 



Only a subset of all encoded genes is expressed in any given cell, 
ind the levels and timing of expression govern cellular develop- 
nent, differentiation, function, and physiology. We describe a 
nethod for the simultaneous monitoring of the expression levels of 
many genes in parallel. The approach involves the determination 
of the relative concentrations of mRNAs based on hybriditation of 
entire mRNA populations to high-density arrays of synthetic 
oligonucleotides. The arrays used contain either 16^000 or more 
than 65,000 different 20-mer oligonucleotides of defined sequence 
in an area of 1.6 cm-. The surface-bound oligonucleotides are 
referred to as probes because they serve to probe, or interrogate, 
the composition of the RNA population. The DNA probes are 
synthesized directly on derivatized glass slides using a combination 
of photolithography and oligonucleotide chemistry'"*, and each of 
the probe sequences can be independently specified. The arrays 
arc made in large numbers in a controlled environment and are 
highly reproducible. 

Various methods are available for detecting or quantitating 
mRNAs. Methods such as Northern blots and nuclease protection 
have the disadvantage of being inherently serial, involving measur- 
ing a single mRNA ai a time, and of being difficult to automate. 
Differential display"* of amplified subsets of RNAs on a sequencing 
gel allows a broad search for expression differences, but the results 
generally are not quantitative, false posiiives are common, and 
characterization of positives requires additional cloning and sequenc- 
ing. Sequencing of cDNA libraries" ' is a more direct approach, but 
requires 3 great deal of sequencing and is not sensitive to the presence 
of less abundant messages. The Serial Analysis of Gene Expression 



(SAGE) method of Velculescu et al' is a clever and efficient varia- 
tion of the cDNA sequencing approach. The method involves fairly 
complicated sample preparation procedures, still requires a large 
amount of sequencing, and tends to be laborious and not particu- 
larly sensitive. 

Array-based methods that involve the spotting of multiple 
clones or cDNAs onto nylon membranes'^-'' or modified glass 
microscope slides"* have been described. These approaches have 
the inherent advantages of being parallel with a direct and rapid 
readout of the hybridization results. The disadvantages are that 
in order to monitor many genes, a large number of cDNAs or 
PCR products must be prepared, purified, quantitated, cata- 
logued, and spotted onto a solid support. If the cDNAs are 
derived from a cDNA library^ low abundance cDNAs are unlikelv 
to be spotted and the library must be normdized to reduce the 
redundant spotting of cDNAs from highly expressed genes. One 
potential advantage of using spotted cDNAs is that they might be 
expected to hybridize with greater affinity and specificity than 
shorter oligonucleotide probes- However* at present this does not 
appear to be the case. In the method of Schena et ah'** and Shalon 
et aI.'^ this may be because the spotted cDNAs are attached to the 
surface of poiylysine-coated glass slides through a large number 
of interactions between the negatively charged phosphate back- 
bone and the positively charged surface-bound lysine sidechains. 
These interactions tend to reduce the conformational freedom of 
the bound cDNAs and hence their affinity for complementarv 
molecules in solution. 

Our approach differs from spotting methods in that mRNA 
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The development of VLSIPS^* technology has provided 
methods foir making very large arrays of oligonucleotide pro): 
in very small arrays. See U.S. Patent No. 5^143,854 and PCI 
patent publioation Nos. WO 90/15070 and 92/10092, &ach of 
5 vhich is incorporated herein by reference. U-S. Patent 

application Serial No, 052,937, filed June 25, 19SJ3, descrii 
methods for making arrays of oligonucleotide probes that car 
be used to provide the complete sequence of a target nucleic 
acid and to detect the preseTice of a nucleic acid containing 

10 specific nucleotide sequence- 
Hi cr of abri<?ated arrays of large numbers of 
oligonucleotide probas, called ^DUA chips" offer great proiui 
for a wide variety of applications • Wew methods and reagent 
are required to realize this promise, and the present 

15 invention helps meet that need, 

SUMMARY OF THE INTVENTION 
TJie invention provides several stirategies employing 
immobilized arrays of probes for comparing a reference 
sequence of Jcnown sequence with a target sequence showing 

20 substantial similarity with the reference sequence^ but 

differing in the presence of, e,g. , mutations. In a firsit 
embodiment, the invention provides a tiling strategy employi 
an array of immobilized oligonucleotide probes comprising at 
least two sets of probes* A first probe set comprises a 

25 plurality of probes, each probe comprising a segment of at 

least three nucleotides exactly complementary to a subsequer 
of the reference sequence # the segment including at least or 
interrogation position complementary to a corrasponding 
nucleotide in the reference sequence. A second probe set 

3 0 comprises a corresponding probe for ©ac^i probe in the first 
probe set^ the corresponding probe in the second probe set 
being identical to a sequence comprising the corresponding 
probe from the first probe &et or a subsequence of at least 
three nucleotides thereof that includes the at least one 

3 5 interrogation position, except that the at least one 
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